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I. INTRODUCTION 



Observables pertaining to the configuration of a quantum system with n degrees of freedom are operators 
Qi, Q2, ■ ■ ■ , Qn which, depending on the system, may or may not commute. Our main concern is with the 
case in which the configuration variables are amplitudes of certain field modes. 

For boson fields, these configuration observables do commute, and the state space H can be taken as 
the space of all complex square integrable functions on their joint spectrum. This is the Schrodinger g-space 
representation, and the fact that in it the state space is a function space, and not just an abstract Hilbert 
space, is very helpful in the analysis of such systems. As one example, it sometimes turns out that physically 
interesting operators preserve the cone of positive functions, and this opens the way to the application of 
the Perron-Frobenius theorem in the study of ground states of such systems. 

For fermion fields, the configuration observables do not commute, and this simple g-space representa- 
tion is not available. However, the non-commutative integration theory of Irving Segal [Se53] permits the 
construction of a suitable substitute, and in fact it was created with such a purpose in view. This approach 
to the study of fermion systems has been extensively developed by Gross [Gr72] who, among other things, 
proved a version of the Perron-Frobenius theorem adapted to the setting and applied it to prove existence 
and uniqueness of ground states for certain fermion quantum field models. 

The main estimate which enabled Gross to apply his Perron-Frobenius type theorem to fermion fields 
was a hypercontractivity estimate for the fermion oscillator semigroup. Corresponding hypercontractivity 
estimates for boson fields had been introduced earlier by Nelson [Ne66] who later obtained the optimal such 
bound for bosons [Ne73]. 

Our main result, Theorem 4 below, is the corresponding optimal hypercontractivity bound for fermions. 
Before stating this theorem, we describe its mathematical and physical contexts in some detail because it 
cannot even be formulated naturally in the conventional Fock space language. (Of course its perturbation 
theoretic consequences can be expressed quite naturally in the usual language.) 

The state space for a system of n fermion degrees of freedom is conventionally realized as the Fock space 

^ = ©" =0 ((<n Aj ) (i-i) 

The basic "free Hamiltonian" on this space is the fermion number operator 

n 

H = J2cjCj (1.2) 

where Cj and c* are the usual fermion annihilation and creation operators acting on T . The fermion oscillator 
semigroup is the semigroup of operators exp(— tHo) that it generates. 

For n boson degrees of freedom, the state space may be realized either as the boson Fock space, or 
as L 2 (JR n , (2n) n / 2 e~ x / 2 dx). The natural isomorphism between these spaces was pointed out by Segal, 
and the latter may be regarded as the Schrodinger g-space realization. The boson oscillator semigroup is 
the semigroup generated by the boson number operator. Though we have just defined it in Fock space 
terms, it may also be considered as an operator semigroup on L 2 (JR n , (2n) n / 2 e~ x / 2 dx). In this setting, 
it is hypercontractive; i.e., for any finite p greater than 2, there is t p suffcicntly large, for which the 
semigroup is a contraction from L 2 {WC, (2ir) n / 2 e- x ^ 2 dx) to U>(]R n , {2Tt) n / 2 er x ' '/ 2 dx) for all t > t p . Since 
t p is independent of n, this result proved useful in treating perturbation theoretic problems for boson fields. 
This hypercontractivity inequality cannot be formulated naturally in the Fock space setting because no 
natural notion of u L pv can be introduced there. The LP(JR n , (2ir) n / 2 e - x l 2 dx) setting is essential. 

In order to to pass from the Fock space description for a system of n fermion degrees of freedom to a 
non-commutative analog of the Schrodinger g-space description (in which we have non-commutative analogs 
of LP norms), we introduce configuration observables Qj = Cj + c*, and let C(Q) be the algebra with unit 
which they generate. This is an operator algebra on the 2™-dimensional Hilbert space T . As we explain in 
Section II, it is a Clifford algebra naturally associated to d™ with its usual inner product. 
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What follows here shall be explained in more detail in Sections II— IV, but the key fact for our present 
considerations is that the 2™ distinct monomials in the Qj are a basis for C(Q) as a vector space. Thus, any 
A in C(Q) can be uniquely written as 

A = al + ajQj + y~] a>j,kQjQk H h ai,..., n Qi ■ ■ ■ Qn • (1-3) 

It is not hard to see that any vector v in T can be written as v = AQ for some unique A G C(Q) acting on 
the vacuum Q. The operator exp(— tHo) acts on such vectors v. Thus, exp(~tHo)v = Bfl for some unique 
B E C(Q) (which depends on t and A, of course). The map Pt : A n 5 is clearly linear, and a simple 
calculation (elucidated in Section IV) shows that 

P t A = al + e _t a iQi + e ~ 2 ' a i,kQjQk + ••• + e"" t a 1 ,... i „Q 1 ■ • • Q n . (1.4) 

Instead of considering Fock space, T ', and the operator exp(— tHo), Segal's idea is to consider C(Q) and 
the linear operator P t that acts on C(Q) as above. Following Segal, we can construct L p norms on C(Q), 
the most important case being p = 2, in terms of which one can do quantum mechanics without mentioning 
Fock space. 

To construct the norms, define the linear functional r : C(Q) — > (E by 

T(A) = (n,An)j: . (i.5) 

It turns out, as we explain in Section III that t(A) is, for A e C(<2), nothing other than the usual normalized 
trace on T applied to A. 

The basic ingredients of a non-commutative integration theory are an algebra of operators such as C(Q), 
and a linear functional on it such as r. Norms analogous to the LP norms of commutative integration theory 
can now be introduced by 

\\A\\ p =(t(A*A)p/ 2 )) 1/p for l<p<oo ; 

H^Hoo denotes the operator norm of A. Let C P (Q) denote C(Q) equipped with the corresponding norm. 

Having expanded our horizons beyond Hilbert space, we can ask for bounds between ||P t j4|| g and \\A\\ p 
for different q, p and t Our main result, Theorem 4, is the optimal fermion hypercontractivity inequality; 
i.e. for all 1 < p < q < oo and all A in C P (Q), 

\\P t A\\ q <\\A\\ p when e~ 2t < , (1.6) 

and the t saturating the inequality on the right is the smallest for which the inequality on the left always 
holds. We prove this for n degrees of freedom with n an arbitrary finite integer. Since the estimate is 
independent of n, a theorem of Gross [Gr72] implies that it holds as well with infinitely many degrees of 
freedom. 

The result (1.6) was conjectured by Gross [Gr75] who proved it [Gr72] in the special case p = 2, q = 4. 
The cases p = 2, q = 2m where m is an integer were proved by Lindsay and Meyer [LiMe] following earlier 
work by Lindsay [Lin] on the case p = 2, q = 2 m . Since P t is self adjoint, duality yields a corresponding 
family of results in which q = 2. Until now all other cases (except when n = 1 or 2) had remained open. 
The optimal relation between t, p and q found here for fermion hypercontractivity is the same as that found 
by Nelson [Ne73] for boson hypercontractivity. 

By now there are many proofs of Nelson's inequality. Neveu's elegant proof [Nev], like Nelson's original 
proof, is based on probabilistic methods. Proofs based on geometric methods have been given by Carlen and 
Loss [CL90] and by Lieb [Li90] who considers generalizations in which the Mehler kernel (the kernel for the 
boson oscillator semigroup) is replaced by an arbitrary Gaussian kernel. More references can be found in 
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the bibliography to Gross's article [Gr89]. However, none of the existing approaches to the boson problem 
has been found to solve the fermion problem. 

As a corollary to the optimal fermion hypercontractivity inequality we obtain the optimal fermion 
logarithmic Sobolev inequality: 

r(\A\ 2 In \A\ 2 ) - {\\A\\l In || A\\j) < 2r(A*H A) (1.7) 

where \A\ = {A* A) 1 ' 2 . 

This inequality was conjectured by Gross, and proved by him in a weaker form with the constant on 
the right increased by a factor of In 3. In studying perturbations of H by multiplication operators V, this 
inequality plays the same role as does the usual Sobolev inequality in studying perturbations of — by 
multiplication operators V on L 2 (JR n ,d n x) [Fe69]. (The multiplication operator associated to a self-adjoint 
element V of C(Q) is defined to be the average of left and right multiplication by V, and is denoted here by 
the same symbol V.) Again, in the standard Fock space setting, there is no natural way to formulate such 
an a-priori regularity inequality, or even to introduce the notion of a multiplication operator. 

This paper is organized as follows: In Section II we study the structure of C(Q) for finite n. It is 
actually simpler, as well as technically advantageous, to consider it as a subalgebra of the algebra C(JC) 
generated by the identity, the configuration observables Q\, . . . , Q n and their conjugate momenta P\ . . . , P n . 
Since they both turn out to be certain Clifford algebras, their structure has been worked out long ago with 
the representation theory of the orthogonal groups. Thus, this section contains no new result but simply 
introduces notation and prepares the way for what follows. Of particular use are an explicit spin- chain 
representation of the operator algebra C(JC), and the Jordan- Wigner transform identifying it with the algebra 
generated by n "hard core bosons" . 

Section III concerns properties of the spaces C P {JC) and their norms. The main result here is an optimal 
uniform convexity inequality for C P (JC), 1 < p < 2, which is joint work with Keith Ball. We need only a 
special case of this inequality here, and a proof is provided in an appendix for the reader's convenience. 

Section IV introduces a convenient expression for P t in terms of the conditional expectation ttq of C(JC) 
with respect to C(Q). The main result in this section is an inequality for conditional expectations which 
enables us to prove that 

sup{\\P t A\\ q : \\A\\ p = l} = sup{\\P t A\\ q : A>0 and \\A\\ p = 1} . 

Thus, to establish (1.6) in general, we need only consider positive A. 

In Section V we establish the optimal fermion hypercontractivity bounds and the corresponding optimal 
fermion logarithmic Sobolev inequality. This is done in several steps. First, using results collected in Sections 
II and III we establish that (1.7) holds for 1 < p < 2 and q = 2. At t — this is an equality; differentiating 
it there yields the logarithmic Sobolev inequality (1.7). Gross showed that (1.6) would follow from (1.7), if 
it were true (as we show here), for all self adjoint A. His result rests on a deep inequality he established 
for positive operators. Since it is not in general true that ||PtA|| q < ||Pt|.A|||g, as is trivially true in the 
commutatative case, Gross's result does not allow us to conclude (1.6) for general (i.e. non-self adjoint) A 
from (1.7). The results of the Section IV do, however, allow us to draw this conclusion. 

Finally, in Section VI we show that the same hypercontractivity relation (1.6) holds for a mixed system 
of bosons and fermions. 

Non-commutative probability theory has grown into a substantial branch of analysis with a number of 
physical applications. The mathematical theory is reviewed and developed in [Me85] and [Me86], while other 
sorts of physical applications, besides those discussed here, are treated in [Da76] and [HuPa] for example. 

It is a pleasure to thank Leonard Gross for discussing his results and conjectures with us, and for 
encouraging us to take up the latter. We are indebted to Keith Ball for his collaboration on the subject of 
convexity inequalities that led to Theorem 1 [BCL], which is one of the key ingredients in the present work. 
Thanks are also due to G.-F. Dell' Antonio, A. Jaffe and A. Wightman for useful discussions. 

II. FERMIONS AND THE CLIFFORD ALGEBRA 
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Wc begin by recalling for later use some well known facts about fermions. 

The fundamental observables for a system of n fermion degrees of freedom are configuration operators 
Qi, Qi, ■ ■ ■ , Qn together with their conjugate momenta operators Pi, P 2 , . . . , P n all acting as operators 
on a complex Hilbert space H and satisfying the canonical anticommutation relations: 

QjQk + QkQj = 2S jk , PjP k + P k Pj = 2S jk , P j Q k + Q k P j =0 • (2.1) 

We denote the complex algebra generated by the identity and the configuration observables by C(Q), and 
the complex algebra generated by the identity, the configuration observables and the momentum observables 
all together by C(JC). C(Q) is the object of primary interest; but many aspects of its structure are most 
readily seen within the larger algebra C(/C). 

This algebra can be concretely represented as the algebra of observables for a spin-chain as follows. We 
define the matrices 



I = 



"1 0" 


, u = 


'1 " 


, Q = 


"0 


r 




' f 


, p = 


1 


-1 


1 







-i 



Let H denote the n-fold tensor product of (E 2 with itself: 

H = (E 2 <8> . . . <8> (D 2 , 

V v ' 

n times 

and on Tt define the operators 

Qj = U®---U®Q®I®---®I , Pj = C/(g)---?7(g)P®/®---®/ (2.2) 

where the Q and the P occur in the jth places. 

The operators Qi, . . . , Q n , P\. . . ,P n just defined are easily seen to satisfy the canonical anticommutaion 
relations. Of great use in studying the algebra C(K.) that they generate is the fact that it is also the algebra 
generated by n "hard core boson" degrees of freedom. More explicitly, put 

Qj = I®---I(g)Q<g)I(g)---<g)I , P,=7®---7®P®7®---®7 , (2.3) 

and call the algebra generated by the operators Qi, . . . , Q n , P\ . . . , P n the hard core boson algebra, and 
denote it by C(/C). To see that the two algebras coincide, put 

fe-i 

Uj=I®---I®U®I®---®I and V k = J| Uj . (2.4) 

Then since PjQj = PjQj =iUj, each V k belongs to both the hard core boson algebra and C{JC). Moreover 

Qk = V k Q k and Pfe^VfePfe (2.5) 

with the inverse relation given as well by 

Qk = V k Q k and P k = V k P k . (2.6) 

Thus, Q k ,P k are in C(/C), and Q kj P k are in C(JC). 

What we call the hard core boson algebra was initially introduced by Jordan and Klein [JoKl] as a first 
attempt to implement the Pauli exclusion principle mathematically. The transformation of observables (2.6) 
was discovered by Jordan and Wigner [JoWi] and used by them to write the algebraic relations characterizing 
the algebra in the familiar covariant form (2.1); today it is known as the Jordan-Wigner transform. It has 
been used many times since - for example, in a solution of the two dimensional Ising model by Schultz, 
Mattis and Lieb, whose paper [SML] can be consulted for references to other applications. It is also the key 
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to Brauer and Weyl's treatment [BrWe] of the Clifford algebra on which the exposition in the rest of this 
section is largely based. Following them, we now show that C(JC) is the full matrix algebra on H. 
First, we introduce a basis in H. Let 

ei = (^j and e_i = ^ 
be the standard basis of (E 2 . For each j = 1, . . . , n, let Oj be either 1 or —1. Then the unit vectors 



provide a natural orthonormal basis for H. 

Next introduce the fermion creation and annihilation operators c* = \{Qj — iPj) and Cj = \{Qj + iPj) 
and their hard core boson analogs c* = ^(Qj — iPj) and Cj = ^(Qj+iPj). Now put L{a\, . . . , a n ) — JXj=i Pj 
where Pj = cj if <Tj = 1 and Pj = Cjd* if (jj = —1. Then L{a\, . . . , a n ) e a 1 ,....<j n = ^ where 

^ = (J) ® • • • ® (J) = e-i,-i,...,-i (2.7) 



is a distinguished unit vector in H called the ground state. Also, L(a\, . . . ,<7 n ) annihilates all the other 
basis vectors. Moreover, L*(a\, . . . , (7„)£! = e CTl) ... iCTrl . Thus, 

L*(n, . . . , T„)L(eri, . . . , (Jn)e ai ,...,cr n — e ril ... lTn , 

and this operator annihilates all other basis vectors. Manifestly this operator belongs to C(/C), and hence to 
C(JC) as well, and the 2™ operators of this kind form a basis for the full matrix algebra on H. 

This concrete description of C(JC) in terms of spin-chain obscrvablcs is the most useful for many purposes. 
Still, it is also useful to have a characterization of C{JC) which is less dependent on coordinates; i.e. on the 
choice of fermi configuration observables Q\, . . . , Q n . 

Toward this end, consider the standard n-dimensional Hilbert space (E™ equipped with its standard 
inner product (•, •) and complex conjugation. Let K. denote d™ considered as a real 2n-dimensional Hilbert 
space equipped with the inner product 

{x,y)K = ^{x,y) ■ 

Then complex conjugation on (E™ induces an involutory orthogonal transformation J on JC. Let Q and V 
respectively denote the eigenspaces of J corresponding to the eigenvalues +1 and —1. The bilinear form 
on K. given by ^(x, y) is symplectic so that K is naturally endowed with the structure possessed by the 
classical phase space of a system of n linear degrees of freedom. Q is called the configuration space, V 
the momentum space, and the complex conjugation J is usually identified with time reversal. 

The Clifford algebra C(K-) is characterized up to automorphism as the algebra with unit / such that: 

(i) There is a linear imbedding J : JC — > C(JC), and J{fC) generates C(JC). 

(ii) For all x, y e JC, 

J(x)J(y) + J(y)J(x) = (x,y) !C . (2.8) 

To make contact with our previous concrete description, let {q±, . . . , q n } be an orthonormal basis of (E n 
consisting of purely real vectors. For each j, let pj = iqj. Evidently Q is spanned by {qi, . . . , q n } and V is 
spanned by {p\, . . . ,p n }- Any x e JC can then be written as x = Y^j=i ^iQj + Y^j=i VjPj- Using the notation 
introduced above, put J : JC — > C(JC) by 

n n 

:/;.<•* Y^^.Qj ■ X '/•''/ ■ 

3=1 0=1 

Let {xi, . . . , X2n} be any orthonormal basis of C(JC). Then the monomials 

J(x ai )J(X a2 )---J(x ak ) 
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together with I form a basis for the algebra. It is easy to see that the product of any two such monomials 
is a third. Though the multiplication rule can be simply expressed in terms of certain contraction rules, 
its precise form is not useful to us here. What is useful to observe is that since the right side of (2.8) is 
invariant under orthogonal transformations of JC, the multiplication law of these monomials does not depend 
on the choice of the orthonormal basis. For this reason, any orthogonal transformation R of JC induces an 
automorphism of C(JC) which we shall also denote by R. Indeed, with R : C(JC) — > C(JC) defined by 

R(J{x ai ) • • • J(x a „)) = J{R{x ai ))---J{R{x ak )) , 

R is evidently invertible and by the remarks made just above one easily sees that for all A, B € C(JC), 
R(AB) = R(A)R{B) which is to say that R is an automorphism. 

Finally, we remark that C(1C) is a *-algebra; there is a unique conjugate linear involutory antiautomor- 
phism A \—* A* which is the identity on J(JC). It is given by 

R(j(x ai )---J(x ak ))* = J(x ak )---J(x ai ) . 

Of course, on C(JC) regarded as a matrix algebra, this is just the usual adjoint. Evidently the automorphism 
R of C(JC) induced by an orthogonal transformation R of JC is a ^-automorphism; i.e. R(A*) = (R(A)) . 

The facts that orthogonal transformations of JC induce automorphisms of C(JC), that C(JC) is a full 
matrix algebra, and that all automorphisms of full matrix algebras are inner; i.e. of the form A i— » SAS -1 
for some nonsingular matrix S, are the basis of Brauer and Weyl's treatment of the spin representations of 
the orthogonal groups [BrWe]. We will use the fact that all automorphisms of C(JC) are inner several times 
in what follows. 



III. ANALYSIS ON THE CLIFFORD ALGEBRA 



Let A be a von Neumann algebra of operators on some finite dimensional Hilbert space. By a trace on 
A we shall mean a linear functional r which is positive in the sense that t{A* A) > for all non zero A in A, 
and cyclic in the sense that t(AB) = t(BA) for all A and B in A. Such a functional is evidently unitarily 
invariant in the sense that whenever A and U belong to A, and U is unitary, then t(U*AU) = t(A). Since 
C(JC) is a full matrix algebra, it contains all unitaries. Hence any trace on C(JC) must assign the same value to 
all rank one projections, and thus must be a scalar multiple of the standard trace Tr on the matrix algebra. 
Henceforth, t shall denote this trace normalized by the condition that t(I) = 1 and Tr shall denote the 
standard unnormalized trace. 

In the non-commutative integration theories of Dixmier [Di53] and Segal [Se53] , the trace functional r 
is the non-commutative analog of the functional that assigns to an integrable function its integral. When the 
Hilbert space is infinite dimensional, some further regularity properties are required of r in order to obtain 
a useful analog. Since all of our estimations will be carried out in the finite dimensional setting, wc shall not 
go into this here, but shall simply refer the reader to these original papers as well as the accounts in [Gr72] 
and [Ne74]. 

Norms on C(JC) which are the non commutative analogs of the LP norms can now be introduced; namely 
for 1 < p < oo we put 

\\A\\ p =(t({A*A)^)) 1/p , (3.1) 

and denote the operator norm of A by ||j4||oo. 

C P (JC) shall denote C(JC) equipped with the norm || • || p ; evidently C 2 (JC) is the Hilbert space of 2" x 2™ 
matrices equipped with the Hilbert- Schmidt norm. 

Consider the monomials 

E[ ai ...,a j -I3 1 ,...,fj k ] = Q ai ■ ■■QajPfl! ' ' • Pfi k (3.2) 

where a\ > ■ ■ ■ > aj and (3\ > ■ ■ ■ > /3k and j + k > 0. Evidently 

E [ ai ..., a] ;f3 1 ,...,l3 k ] E [ ai ..., a3 ;l3 1 ,...,f3 k ] = 1 
7 



and thus ||-E : [ai...,a :j ;/3i,...,/3 fe ] Hp = 1 f° r all p. Moreover 

T{E [ai ... iCtj . 01t ... iPk] ) = . (3.3) 

To see this, first consider the case in which j + k is odd. The inversion x i— > — x on JC is orthogonal. Hence 
it induces an automorphism of C(JC), and hence there is an invertible S in C{JC) so that 

-- E '[ai...,aj;ft,...,/3 fc ] = SE[a 1 ...,a j ;0 u -,l3k] S 1 ' 

Then, by using cyclicity of the trace we get the desired result. Next consider the case in which j + k is even 
and, say, j > 0. Then write i?[ai...,c«,-;/3i.---,/3fc] = Qa^X, and note that by (2.1), Q a± X = —XQ ai . Again the 
desired conclusion follows from cyclicity of the trace. 
It is easy to see from this that 

T ( E [a 1 ...,a j ;0 1 ,...,0 k ] E [ 11 ..., lm ;S 1 ,...,S n ]) = ( 3 - 4 ) 

unless the two monomials coincide. Thus, together with the identity, the assemblage of such monomials 
forms an orthonormal basis for C 2 (/C). Finally observe that since Qe_ = e + 

(n,E [ai _ a . ] n) = o (3.5) 

whenever j > 1, and, as indicated, k = 0. It now follows that, restricted to C(Q), 

t(A) = {SI,A£1) (3.6) 

for all A in C(/C). 

Formula (3.6) is very important for us. It permits us to calculate the "physically" relevant quantity 
(fi, AST) in terms of the apparently mathematically simpler quantity t(A). 

Many familiar inequalities for L p norms hold for the C p norms as well [Di53] . This is true in particular 
of the Holder inequality 

\\AB\\ r <\\A\\ p \\B\\ q \= l - + ~ q ■ 

Certain optimal inequalities expressing the uniform convexity properties of the L p norms also hold for 
the C p norms, and this fact constitutes one cornerstone of our analysis. 
The modulus of convexity S p of C P {K) is defined by 

S p (e) = inf {1 -l\\A + B\\ p : \\A\\ p = \\B\\ p = 1 , \\A - B\\ p = e} (3.7) 

for < e < 2. For 1 < p < oo, S p is always positive which means these norms arc uniformly convex. Useful 
geometric information is contained in the rate at which 5 p (e) tends to zero with e. It is known [TJ74] that 
for 2 < p < oo, 5 p (e) ~ e p , but that for 1 < p < 2, 5 p (e) — e 2 . 

An optimal expression of this fact is given by the following theorem which was proved jointly with Keith 
Ball [BCL] : 

Theorem 1: (Optimal 2-uniform convexity for matrices). For all m x m matrices A and B and all 

V f or 1 < P < 2 ; 



(Tr\A + B\ p +Tr\A- B\ p \ 2/p 



\ 2 
For 1 < p < 2, there is equality only when B = 0. 



> (Tr\A\ p f /p + {p-l)(Tr\B\ p f /p . (3i 



This result, which we interpret here as a statement about C p (/C), is proved in the appendix in the special 
case that both A + B and A — B are positive; this is the only case in which we shall use it here, and the proof 
is considerably simpler in this case. The full result is proved in [BCL], in which other geometric inequalities 
for trace norms are proved as well. 

The theorem implies that 

W>^y^) 2 for Kp<2 (3.9) 

as one sees by considering A = (C + D)/2, B = (C-D)/2, \\C\\ P = \\D\\ p = 1 and \\C - D\\ p = e. It is easily 
seen that the constant (p — l)/8 cannot be improved. 

We make our main application of this result in Section V. There we will also need to know that the 
norms on C P (JC) are continuously differentiable away from the origin for 1 < p < oo. This is known [Gr75], 
but a simple proof can be based on inequalities of the form 5 p (e) > K p e r ^ such as we have found above for 
1 < p < 2. This proof, moreover, gives the modulus of continuity of the derivative, and is sketched in the 
appendix as well. Again, these estimates are independent of the dimension and therefore apply to the case 
of infinitely many degrees of freedom. 



IV. CONDITIONAL EXPECTATIONS AND THE FERMION 
OSCILLATOR SEMIGROUP 



We are particularly concerned with the subalgebra C(Q) of C(fC), and the conditional expectation 
[Di53] [Um54] with respect to it shall play a basic role in our investigation. For any A in C(/C), the conditional 
expectation ttq(A) of A with respect to C(Q) is defined to be the unique element of C(Q) such that 
t(B*itq(A)) = t(B*A) for all B in C(Q). Otherwise said, ttq is the orthogonal projection from C 2 (fC) onto 
C 2 (Q). It is well known that the conditional expectation is positivity preserving; a familiar argument shows 
that tt q (A*A) > tt q (A)*it q (A). 

We can use the conditional expectation to give a useful expression for the oscillator semigroup for 
fermion fields. 

Let Rg be the orthogonal transformation of JC given by 

R e (qj) = (cos%j + (sin9) Pj (4.1) 

for each j. Of course Re gives the evolution at time 9 on phase space JC generated by the classical oscil- 
lator Hamiltonian i?(p,q) = Y^j=i \{p] + Qj)- Let Rg denote the automophism of C(K) generated by the 
orthogonal transformation Rg as in the first section. For each t > 0, define 6(t) = arccos(e~*) and define the 
operator P t on C 2 (JC) by 

P t A = TTq O Rg (t) O TTqA . (4.2) 

Note that 7Tq is the natural imbedding of C(Q) into C(JC), and regarded as such, it is a *-automorphism. 
Formula (4.2) is the analog of the familiar expression for the boson oscillator semigroup on L 2 (Q,(2^)-"/ 2 e-" / 2 d», 
i.e. the Mehler semigroup 

P t (6MOB) A(q) = J A^e-tq + (1 - e^) 1 ^ (27r)-"/ 2 e -f 2 / 2 d> . 

Note that since all of the operators on the right in (4.2) are positivity preserving, so is P t . Also, since 
the first two operations on the right preserve the C p -norms, and since the conditional expectation is readily 
seen to be a contraction from C p (fC) to C P (Q) for each p, it is readily seen that P t possesses this property as 
well. 

To obtain a more familiar expression for P t , note that 

Rff(t) ° nQ( E [a u ...,a k ]) = e^ fe *-E[ Ql ,...,a fc ] + (terms annihilated by tt q ) . 
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Hence 

Pt(E [au ... iCtk] )=e- kt E [au ... iCtk] . (4.3) 

Evidently {P t : t > 0} is generated by Hq where Ho(E[ ai Qfc j) = kE[ ai Qfc j. It is easy to see that under 
the unitary equivalence between C 2 (Q) and fermion Fock space T described by Segal [Se56], Ho is equivalent 
to the usual number operator, or in other words, oscillator Hamiltonian on T . 

Our primary goal is to prove optimal hypercontractivity bounds for P t . That is, given 1 < p < q < oo 
we want to show that for some finite t, Pt is a contraction from C P (Q) to C q (Q), and to find the smallest 
such t. Let 

||P t || p ^=su P {||P t A|| g : \\A\\ P = 1} . (4.4) 

As a first reduction, we shall show that the suprcmum on the right in (4.4) can be restricted to the positive 
operators A with || A\\ p = 1. In the boson case this follows immediately from the fact that, in ordinary prob- 
ability theory, the absolute value of a conditional expectation is no greater than the conditional expectation 
of the absolute value. 

In general, matters concerning the absolute value in the non-commutative setting are more troublesome 
than in the commutative setting. An example is provided by the Araki-Yamagami inequality [ArYa] which, 
specialized to our context, asserts that the map A \— ► \A\ is Lipschitz continuous on C 2 {JC) with constant \[2 
instead of the constant 1 which we would have in the commutative setting. 

Thus while the conditional expectation in an operator algebra has many properties analogous to those 
of the conditional expectation in ordinary probability theory [Um54], it is not in general true that |7Tq(j4)| 
will be a smaller operator than 7Tq(|A|). The following theorem expresses a useful property in this direction 
which does hold, and after proving it we shall show by example that stronger properties do not hold. The 
theorem and its proof are easily extended to a more general von Neumann algebra setting by the methods 
in [Ru72]. 

Theorem 2: (A Schwarz inequality for conditional expectations). For all A in C(JC) and all p 
with 1 < p < oo, 

lkQ(^)l| P <lk Q (m)liy 2 |k Q (|A*|)||y 2 . (4.5) 



Remark: If we let F(A) denote ||7rgA|| p , then the same argument which we shall use to prove Theorem 2 
also establishes that 

F(A*B) < F{A*A) 1 / 2 F{B*B) 1 / 2 . (4.6) 

In this form, the term "Schwarz inequality", by which we referred to (4.5), is more evidently appropriate. 
Moreover, inequalities of the type (4.6) are well known in matrix analysis for many familiar functions; for 
example when F(A) is the determinant of A or the spectral radius of A. Further examples can be found in 
[MeDS]. 

In [Li76] it is shown that for a function F that satisfies (4.6), and which is monotone increasing; i.e. 
satisfies F(B) > F(A) for all B > A > 0, the following inequalities hold: 

m m m 

f(£a;Bj) < P(^(A*A j ) 1 /2 jF (^( jB ; B .)i/2 

and 

mm m 

fC£ a j) < p(E(i^i) 1/2 ^(E(i^i) 1/2 • 

i=i i=i i=i 

In particular, these inequalities hold for F(A) = ||7rgA||p. 

Specializing the last inequality to the case m = 1 then yields (4.5). In our present case however, the 
proof of the (4.6) is essentially the same as the direct proof of (4.5). Nonetheless, it should not be considered 
novel that by taking \ A*\ into consideration as well as \ A\, we can obtain a suitable bound on ||7TqA|| p . 
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Proof: Let A = U\A\ be the polar decomposition of A. Then ||7r g ( J 4)|| p = t(CU\A\) for some C in C(Q) 
with \\C\\ P > = 1. Let C = V\C\ be the polar decomposition of C. Both V and |C| belong to C(Q) as well. 
Thus 

|K S (A)|| P = riCUlA^Al 1 / 2 ) = rdCl^UlA^A^VlCl 1 / 2 

<T(|C| 1 /2 £ ;|^|i;*|qV2) 1 /2 T (| C ,|l/2 v .| i4 | y | C |l/2)V2 

= r(|q(L/|A|^)) 1/2 r((t/|qT/*)|A|) 1/2 < ||tt q (C/|^| C/*) 2 1| tt q (|^| ) ||V 2 
Finally, we note that U\A\U* = \A*\. O 



Example Let A be the matrix A 



\A\=V2 



1 
1 


1 



. Then 



and \A*\ 



V2 



1 1 
1 1 



Note that \A*\ is in C(Q), but A and \A\ are not. One easily finds 



1 1 
1 1 



k e ( A )lloo = i 



MI^I)lloo = 



1 o 
o 1 



V2 



1 1 
1 1 



and 11^(1^*1)1100 = ^ 



Theorem 3: (P t has positive maximizers). The norm of P t from C P (Q) to C q (Q) is achieved on the 
positive operators; i.e. 



= sup{||P t ,4|| 9 : A>0 and \\A\\ p = 1 } 



(4.7) 



Proof: Since Rg and ttq are both ^-automorphisms, \R$(t) ° ^g-^l — Re(t) ° ^gl^-l an d K-Re(t) ° ttq^)*! = 
R e(t) on* Q \A*\. Thus ||P t A||, < ||P t | A\ ||J /2 ||Pt|^| \\l /2 , and of course |||A*||| P = |||A||| P = ||A|| P . □ 

V. HYPERCONTRACTIVITY FOR FERMIONS 

Our main result is the following theorem which is established in this section. 

Theorem 4: (Optimal fermion hypercontractivity). For all 1 < p < q < oo, ||Pt|| p ^g = 1 exactly 
when 

e-*< P —± . 

q - 1 



The heart of the matter is the following lemma: 
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Lemma: For all 1 < p < 2, ||P t || p _ > 2 = 1 exactly when e 24 < (p — 1). 



Proof: Fix a positive element A of C(Q). Pick a basis {qi, . . . ,q n } of Q, and let C(Q(„_i)) denote the 
Clifford algebra associated with the span of the first n — 1 of these basis elements. It is evident from the 
form of our standard basis of C(Q) that A can be uniquely decomposed as A = B + CQ n where B and C 
belong to C(<2(„_!)). Then using the Jordan- Wigncr transform we can write A = B + CV n Q n . Now write 

H = H (n - 1) ®(E 2 (5.1) 

so that B, C and V n C can be considered as operators on the first factor ?i(„_i). Let u± = (e + ± e^)/^/2, 
so that Qu± = ±u±. Then if v is any vector in W(„_i), 

((v®u±),A(v®u±)) H = {v,(B±CV n )v) H(n _ 1) . (5.2) 

We see from this that since A > 0, so are both B + CV n > and B — CV n > 0. Now let Tri and Tr 2 denote 
the partial traces over the first and second factors in (5.1), so that with Tr still denoting the full trace, we 
have Tr = TriTr 2 - Now applying Theorem 1 in the special case which is proved in the appendix 

\\A\\ 2 p =[^-Tr\B + CV n Q n \ 

_ ( l y^f TnlB + CValP + TnlB-CVnlP 

^ (w^j) 2/P (^ B ^ 2/P + (p-wri\c\n 2,p ) 

since V n is unitary. 

Thus ||A|| 2 > ||i3|| 2 + (p- 1)||C|| 2 where the norms on the right are all norms on C(Q(„_!)). 
Now we make the inductive assumption that the lemma has been established for C(Q(„_!)). This is 
clearly the case when n = 1. We then have 

\\A\\l>\\P t B\\l + (p-l)\\P t C\\i 

where e~ 2 * = (p — 1). But clearly from (4.3) ||P t C<2„||2 = (p — 1 ) 1 1 C 1 1 1 where the norms are once again 
norms on C(Q). Moreover by (3.4) P t B and P t CQ n are orthogonal. Thus \\P t B\\l + (p - l)||P t C||i = 
||P(P + CQ„)|| 2 . = ||P tJ 4|| 2 . □ 

Theorem 5: (Optimal fermion logarithmic Sobolev inequality). For all A € C(Q), 

r(\A\ 2 \n\A\ 2 )-(\\A\\ 2 2 ln\\A\\l) <2(A,H A) . (5.3) 



Proof: By the lemma, ||P t ^4||| < ||^4|j 2 1+e -2t) and there is equality at t = 0. Both sides are continuously 
differentiable, and comparing derivatives at t — we obtain the result. Indeed, 

A||A|| p = l||A||^(r(|A|Hn|A|)-||^ln||A|| p ) (5.4) 

and of course 

^ll*U||l = ±(A,P 2t A)=-2(A,H A) .□ 
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Gross refers to the quadratic form on the right side of (5.3) as the Clifford Dirichlet form since it shares 
many properties of Dirichlet forms in the ordinary commutative setting. An approach to the development 
of a theory of Dirichlet forms in the non-commutative setting can be found in [A1HK] . 



Proof of Theorem 4: By a deep result of Gross, when A > and 1 < p < oo, 

(A p / 2 ,H A p / 2 ) < (P/^^HoAP- 1 ) . 
P- 1 

Replacing A in (5.3) by A p / 2 and using the inequality just quoted we obtain, following Gross's ideas [Gr75], 

r(A p \nA) \\A\\ p \n\\A\\ p < l/2-{A*-\ H A) . 

By combining this with (5.4) a differential inequality is obtained which implies that ||P t A|| g ( t ) is a decreasing 
function of t when q(t) = 1 + e 2t (p — 1). This establishes the result for A > 0, and by Theorem 3 it is 
established in general. By (4.3), PtI — I, and therefore ||P t |j p ^ g is always at least 1 for all p and q. That 
the inequality is best possible follows from a direct computation with one degree of freedom. To be precise, 
\\Pt(I + Qi)||q = \\I + e ~ t Qi\\q is easily computed and compared with ||7 + [Gr72]. The first quantity 

is greater than the second if e~ 2 * > (p — l)/(q — 1). □ 



VI. HYPERCONTRACTIVITY FOR BOSONS AND FERMIONS TOGETHER 



As a result of the present work and of earlier work on bosons, we know that for t given by e~ 2t = 
(p — l)/{q— 1), both the fermion and the boson oscillator semigroups are contractive from the appropriate 
p-spaces to the appropriate q-spaces, and that this value of t is optimal for each case separately. 

It is natural to expect that the same condition governs hypercontractivity in a situation in which we 
have bosons and fcrmions together. This is indeed the case, as we now show using Minkowski's inequality in 
an argument based on Segal's method for showing that the optimal conditions for hypercontractivity with 
m boson degrees of freedom are the same as for one degree of freedom. 

Let /j,(dx) = ( 2n)- m / 2 e' x ' 2 &x be the unit Gauss measure on IR m . Then in our mixed setting, with m 
boson degrees of freedom and n fermion degrees of freedom, the relevant p-space is 

BP = L?(JR m ^)®CP(Q {n) ), 

which may be regarded as consisting of C P (Q(„)) valued measurable functions x \— ► A(x) such that 

\\A\% = ( \\A{x)\\l^dx) 

is finite. This equation defines the norm on B p . For p = 2, B p is naturally isomorphic to the tensor product 
of the symmetric tensor algebra over (E m and the antisymmetric tensor algebra over (E n as shown by Segal. 
On the latter space we have the mixed oscillator semigroup generated by the sum of the boson and fermion 
number operators 



-t J2 + J2 c j c 3 } 



.?=i 

Considered as operators on B p , the operators P t which constitute this semigroup are given by 



T t A(x)= [ M t (x,x')P t [A(x')]fi(dx') (6.1) 

JR M 
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where M t (x,x') is the Mehler kernel; i.e., the positive integral kernel for the boson oscillator semigroup 
p(boson) (jjg^gggd m s ec tion IV. Of course P t denotes the fcrmion oscillator semigroup studied throughout 
this paper. 

Now successively applying Minkowski's inequality, our theorem on optimal fermion hypercontractivity, 
and Nelson's theorem on optimal boson hypercontractivity, we have for e~ 2 * < (p — l)/(q — 1): 



WtAM 



< 



< 



R" 



R" 



R" 



R" 



M t (x,x')P t [A(x')]n(dx') 



fi(dx) 



M t (x,x')\\P t A(x')\\ q ^dx')) n(dx) 



f (f M t (x,x')\\A(x')\\ P n(dx')) »(dx) 
< ( [ \\A(x)\\l„(dx)) q/p = \\A\H . 



APPENDIX 



Proof of Theorem 1 when A ± B > 0: Let Z and W be the 2m x 2m matrices given by 



Z = 



'A 


" 




B 





A 


, W = 


—B 



Our goal is to establish that for all r with < r < 1, 

(Tr{A + rB)>+Tr(A-rB)y<>> > {Tr{A)p)2/p + ^ _ 1)(Tr|fl|I , )a/l , 

or what is the same, 

Tr(Z + rW) 2/p >(Tr(Z)P) 2 ^ + r 2 (p-l)(Tr\W\ p ) 2/p . (Al) 

First, note that the null space of Z + rW is exactly the null space of Z for < r < 1. Thus by carrying 
out all of the following computations on the orthogonal complement of this fixed null space, we may freely 
assume that Z + rW > for all < r < 1. Next, both sides of (A.l) agree at r = 0, and the first derivatives 
in r of both sides vanish there as well. We define ip(r) to be Tr(Z + rW) p . Then the second derivative in r 
of the left side of (A.l) satisfies 



dr 2 



{4>{r)) VP > -TP(rY 2 - p >/ p - 



V>(r) 



p - dr 2 
The second derivative on the right side is just 

2(p- \){Tr\W\ p ) 2 / p , 

and we are left with showing that 

i^( r )(2-p)/pjL_^( r ) > ( p -l)(Tr\W\ p ) 2/p 



(A.2) 



for all < r < 1. By redefining Z to be Z + rW, it suffices to establish (A.2) at r = 0. 

Now 4ptp(r) = p(Tr(Z + rW)( p -^W), since A ± B > 0, Z + rW > for small r, and we can use the 
integral representation 



roo 

{Z + rW) (p - 1] =c p ifr" 1 ) 



1 



t t+(Z + rW) 



dt 
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to conclude that 

j2 roc 

Consider the right side as a function, f(Z), of Z for fixed W. It is easy to see that / is convex in 
Z. (Simply replace Z by Z + tX, with X self-adjoint, and then differentiate twice with respect to t; the 
positivity follows from the Schwarz inequality for traces.) Also, f(UZU*) = f(Z) provided U is unitary and 
U commutes with W. In a basis in which W is diagonal, we form the set U consisting of the 2 2m distinct 
diagonal unitary matrices, each with +1 or —1 in each diagonal entry. Each of these clearly commutes with 
W. Then 

f(Z) = 2~ 2m ]T f(UZU*) > f(2~ 2m ]T UZU*) = f(Z diag ), 
ueu u&a 

where Zd; ag is the matrix that is diagonal in the basis diagonalizing W, and whose diagonal entries are those 
of Z in this basis. 

Replacing Z by Zdi ag in (A. 3), the integration can be carried out, and we obtain 

j 2 /2m \ 

^(o)>p(P-i)(E^" a M) 

where zj and uij, respectively, denote the jth diagonal entries of Z and W in a basis diagonalizing W. 

Now consider ^(0) = Tr(Z p ) as a function of Z. It is clearly convex, and thus by the averaging method 
just employed, we obtain 

(2m 

3 = 1 

To establish (A. 2), we are only left with showing that 

/2m \ (2— p)/p /2m \ /2m \ 2/p 

but this follows immediately from Holder's inequality. 

To complete the proof, observe that equality in (A.l) for r = 1 and 1 < p < 2 implies equality in (A. 4) 
for almost every r in [0, 1]. Here, recall that Zj in (A. 4) really denotes the jth diagonal element of Z + rW; 
these are the numbers Zj + rwj, where Zj denotes the jth diagonal element of Z. 

Let us assume that Wj ^ for some j. Then equality in Holder's inequality (A. 4) requires that the 
vector with positive components Zj + rwj be proportional to the vector with components |tOj|. Thus, for 
almost every r in [0, 1] we require 

Zj+rwj = c(r)\wj\ 

for some number c(r) that depends on r but not on j. The left side above is a linear function, and thus 
c(r) = a + rb for some numbers a and b. But then clearly b — Wj/\wj\, and all non-zero eigenvalues of W 
would necessarily have the same sign. This is impossible since TrW = 0. □ 



-W- 



t + Z t + Z 



-W 



dt 



(A3) 




We now give an application of the uniform convexity implied by this theorem to the differentiability of 
the C P (JC) norms. First we recall that for 2 < p < oo, the modulus of convexity is given by an analog of an 
inequality of Clarkson for integrals which Dixmier [Di53] established for traces. Specializing to C P (JC), this 
inequality reads 

p L 

which implies that in this range (cf. (3.7)) 



A + B 


p 


A — B 




+ 




2 


p 


2 



II? + 



B\\ p p ] 



2 < p < oo 



(A5) 



15 



For any non-zero A in C P (JC), define T>(A) by 

V{A) = \\A\\£- p ^\A\( p -Vu* 

where A = U\A\ is the polar decomposition of A. Let p' be defined by l/p+ 1/p' = 1. Then for 1 < p < oo, 
||2>(A)|| p , = 1 and t(D(A)A) = \\A\\ p . Moreover, V(A) is the unique element of C p '(/C) with this property. 
We call the map A £>(^4) the gradient map on C P (JC). The next theorem sharpens a result of Gross 
[Gr75]. 

Theorem 6: (Holder continuity of the gradient map on C p (/C)). For all 1 < p < oo, the gradient 
map is norm continuous. Moreover, 



,\ \A-B\\ ' 
' \A + B\ 



\\V(A)-V(B)\\ p ,<2(p "; [A i p ' ) /or 1< p < 2 , (A6) 



and 

||X>(A)-P(g)|| p ,<4(p-l)( ||^~^ ) /or _>- ; x . (,1.7) 



Proof: First observe that for all 1 < p < oo 

\\V(A) + V(B)\\ P , \\A + B\\ P > 3tr((V(A) + V(B))(A + B)) 

= 2(\\A\\ p + \\B\\ p ) nr((V(A) V(B)){A B)) 
>2(||A + B|| P )-||X>(^)-2?(B)|| P ,||A-B|| 1 , . 

Thus, 

V{A) + V(B) 
2 

But, by (A. 5) and (3.7), we have 

2?(A)+2?(g) 
2 

for 1 < p < 2. By combining (A. 9) and (A. 8) we obtain (A. 6). Similarly, by combining (3.9) with (A. 8) we 
obtain (A. 7). □ 

The continuity of the gradient map for C P (JC) has been established by Gross [Gr72], but his proof is more 
involved and does not yield an estimate of the modulus of continuity. It is now easy to establish continuous 
differentiability of the C P (JC) norms away from the origin since, with h{t) = \\A + tB\\ p , with A different from 
and with t and s sufficiently small, we have 

?Rt(V(A + tB)B) < k ^ + h ^ < Kt(V{A + {t + s)B)B) (A10) 

when s is positive. To sec this, observe that h(t) = 3tT(V(A + tB)(A + tB)), and that h(t + s) > ^t(V(A + 
tB)(A + (t + s)£?)) by Holder's inequality. By subtracting the expression for h(t) from the estimate for 
h(t + s) and dividing by s, we obtain the inequality on the left in (A. 10). The inequality on the right is 



< 



V(A) - V(B) 



U-B\\ 
U + B\\ 



(A8) 



> — 



V(A) - V{B) 



(A9) 
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obtained in an analogous manner. When s is negative, the inequalities are clearly reversed. Letting s tend 
to zero, we obtain 

d '\A + tB\\ p =^t(V(A)B) . 
t=o 



dt 
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